비디오 세그먼트 단위의 부분 복사 검출을 위한 CNN 기반 프레임 특징 벡터 융합 방법

최정환; 최지원; 류덕산; 김순태; Jeongwhan Choi; Jiwon Choi; Duksan Ryu; Suntae Kim; 정민수; 낭종호; Minsoo Jeong; Jongho Nang

연구문헌

국내 논문지

홈 > 연구문헌 > 국내 논문지 > 한국정보과학회 논문지 > 정보과학회논문지 (Journal of KIISE)

정보과학회논문지 (Journal of KIISE)

Current Result Document :

한글제목(Korean Title)	비디오 세그먼트 단위의 부분 복사 검출을 위한 CNN 기반 프레임 특징 벡터 융합 방법
영문제목(English Title)	A Fusion of CNN-based Frame Vector for Segment-level Video Partial Copy Detection
저자(Author)	최정환 최지원 류덕산 김순태 Jeongwhan Choi Jiwon Choi Duksan Ryu Suntae Kim 정민수 낭종호 Minsoo Jeong Jongho Nang
원문수록처(Citation)	VOL 48 NO. 01 PP. 0043 ~ 0050 (2021. 01)
한글내용 (Korean Abstract)	최근 유튜브나 인스타그램과 같은 콘텐츠 플랫폼을 주축으로 미디어에 대한 수요가 급속하게 증가하고 있다. 이에 따라 저작권 보호나 불법 콘텐츠의 유포와 같은 문제들이 발생하고 있다. 이러한 문제를 해결하기 위해 내용에 기반한 고유의 식별자를 추출하는 방법들이 제안되었지만 기존의 연구들은 미리 정해진 변형에 대하여 고안되었기 때문에 실제 비디오에서는 검출에 실패하였다. 본 논문에서는 실제 유통되는 비디오의 다양한 변형에 강인한 부분 복사 검출을 위해 프레임 정보를 융합한 딥러닝 기반의 세그먼트 Fingerprint를 제안한다. TIRI를 이용한 데이터 수준의 융합 방법과 풀링을 이용한 특징 벡터 수준의 융합 방법으로 추출한 Fingerprint를 Triplet loss를 이용하여 학습하고 검출 시스템을 설계하여 성능을 분석한다. 본 논문의 실험은 유튜브를 기반으로 수집한 데이터셋인 VCDB를 이용하였으며 5초 동안 샘플링한 프레임 특징 벡터를 Max 풀링으로 융합하여 66%의 성능을 얻었다.
영문내용 (English Abstract)	Recently, the demand for media has grown rapidly, led by multimedia content platforms such as YouTube and Instagram. As a result, problems such as copyright protection and the spread of illegal content have arisen. To solve these problems, studies have been proposed to extract unique identifiers based on the content. However, existing studies were designed for simulated transformation and failed to detect whether the copied videos were actually shared. In this paper, we proposed a deep learning-based segment fingerprint that fused frame information for partial copy detection that was robust for various variations in the actually shared video. We used TIRI for data-level fusion and Pooling for feature-level fusion. We also designed a detection system with a segment fingerprint that was trained with Triplet loss. We evaluated the performance with VCDB, a dataset collected based on YouTube, and obtained 66% performance by fusing frame features sampled for 5 seconds with Max pooling for detecting video partial-copy problems.
키워드(Keyword)	컨피규레이션 버그 리포트 선형판별분석 차원축소 클래스 불균형 샘플링 configuration bug report linear discriminant analysis dimensionality reduction class imbalance sampling 비디오 비디오 복사 검출 비디오 부분 복사 특징 융합 CNN 딥러닝 CNN video analysis copy detection partial-copy detection feature fusion deep learn
파일첨부	PDF 다운로드